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ABSTRACT 


The replenishment of a dispersed battle group requires logistic ships to travel long distances 
between ships in the battle group. When operational requirements limit the amount of time that can 
be spent conducting replenishment, decision makers must select which ships to replenish based on the 
amount of time needed to transit between ships, and the combat value added to the battle group by 
replenishment. With proper assumptions, this problem is analagous to the Generalized Orienteering 
Problem. A dynamic programming algorithm is developed using this approach and tested against a 
set of test problems. The algorithm is capable of scheduling replenishment using both Delivery Boy, 
or Circuit Rider tactics. The results indicate that the algorithm runs quickly enough to be useful for 


scheduling underway replenishment in operational situations. 
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Le INTRODUCTION 


A. BATTLE GROUP OPERATIONS 

The ability of the United States Navy to project power 
away from her shores relies heavily on the Carrier Battle 
Group (CVBG). A CVBG consists of an aircraft carrier, her 
air-wing and six to nine combatant ships that are tasked with 
screening the carrier. The carrier and her escorts are 
required to stay at sea for long periods of time. With the 
exception of major equipment failure or battle damage, the 
only limitation to the CVBG’s ability to stay at sea is the 
need for resupply. This need is met through the use of ships 
assigned to the Combat Logistics Force (CLF). These ships are 
specifically designed to deliver food, fuel, ordnance, and 
Spare parts to the CVBG from shore support bases. 

When the logistic ship arrives in the vicinity of the 
CVBG, it finds a formation of ships that are assigned to 
specific sectors for the mutual defense of the battle group. 
When the logistic ship is ready to resupply the screening 
ships, she has three primary tactics from which to choose. 

1. Gas Station 

This method of resupply has the logistic ship stay in 
a position, normally near the middle of the battle group, and 


requires the screening ships to leave their assigned sectors 


and rendezvous at the position of the logistic ship. When the 
screening ship has been resupplied, it returns to its assigned 
sector. 
2. Delivery Boy 
This tactic requires the logistic ship travel to the 
screening ships and perform replenishment. There is no 
requirement for the screening ships to leave their assigned 
sectors during the replenishment process. 
3. Circuit Rider 
This tactic can be viewed as a combination of the 
Delivery Boy and Gas Station tactics. Each screening ship is 
given a sector where replenishment can take place. If each 
ship’s sector only contains the position of the logistic ship, 
this tactic reduces to the Gas Station tactic. If each 
screening ship’s replenishment sector only includes it’s own 
on-station position, this tactic reduces to the Delivery Boy 
Cactic. Under normal conditions, both the logistic ship and 
the screening ship move from their on-station positions and 
meet for replenishment at a mutually convenient rendezvous 


Llocatacne 


B. DISCUSSION 

Figure 1 illustrates the three tactic for a four ship 
Formation. Depending on the operational situation, the Gas 
Station and Delivery Boy tactic have some potential drawbacks. 


In the Gas Station method, a ship that is assigned to a sector 
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Figure 1 Graphic illustration of underway replenishment 
tactics 


for defensive purposes is taken off station for replenishment. 
By vacating its sector, the screening ship is removing its 
defensive ability from the CVBG. In a hostile environment 
this would be undesirable, particularly if an alternative 
tactic is available. The Delivery Boy method is likewise 
deficient in that it may require the relatively defenseless 
logistic ship to travel alone outside the protected area of 
the battle group to rendezvous with a single ship, exposing 
the logistic ship to unnecessary and perhaps unacceptable 


risks. The Circuit Rider tactic combines these two tactics 


and can therefore potentially eliminate some of the drawbacks 


mentioned above. 


C. RELATED UNDERWAY REPLENISHMENT STUDIES 

Hardgrave (1989) studied the feasibility of replenishing 
a CVBG using the tactics discussed above, simplifying the 
problem by using point locations for each of the screening 
ships. Zabarouskas (1992) extended the work of Hardgrave by 
creating a distinct group of locations that represent the 
sector that a screening ship is assigned, and developed an 
optimal branch and bound algorithm to determine the minimum 
time to replenish all ships. Both of these studies 
formulated the replenishment scheduling problem as a Traveling 
Salesman Problem (TSP), (see Wu (1992) for a comprehensive 
review of modelling underway replenishment using the TSP.) 

In Wu (1992), models were developed to address scenarios 
where there is insufficient time to service all the ships 
needing replenishment. In these scenarios a combat value is 
assigned to each ship requiring replenishment. The optimal 
solution results in a schedule that restores the largest 
combat value to the battle group without exceeding the maximum 
time allowed for replenishment. The computational study of Wu 
(1992) demonstrates that solving these problems can require 
excessive CPU times using commercial integer programming 


software. 


D. THE GENERALIZED ORIENTEERING PROBLEM 

Orienteering (Golden, Wang, Liu, 1988) is a sport that 
requires participants to find their way through a course toa 
series of ‘control points’ where they collect a score when 
they arrive. Participants leave from a designated starting 
point and must finish at the end point, but are free to make 
their own paths between the control points. The objective is 
to collect the highest score while reaching the end point 
within the prescribed time. 

In military situations, time constraints are almost always 
present and need to be considered. The Orienteering Problem, 
therefore, more closely resembles the scenario facing the 
decision maker scheduling underway replenishment. 

The problem being addressed in this thesis is the 
Generalized Orienteering Problem (see Wu (1992)) because each 
‘control point’ or ship sector is represented by a discrete 
group of points. All points in each sector are assumed to 
have the same reward, and that reward can only be obtained 


Smce . 


E. DYNAMIC PROGRAMMING 

Dynamic programming, first introduced by Bellman (1957), 
is an optimization technique in which decisions are made in 
stages. The technique implicitly enumerates all possible 
solutions. If the problem can be formulated such that several 


partial solutions can be grouped ina single state, then those 


partial solutions that are inferior can be eliminated from 
future consideration. The process progresses from stage to 
stage eliminating dominated partial solutions until only the 


optimal solution to the problem remains. 


F. OUTLINE 

This thesis describes a dynamic programming procedure that 
under specific conditions meets the need reported by Wu (1992) 
for a specialized algorithm to solve the Generalized 
Orienteering Problem as applied to underway replenishment 
scheduling. 

Chapter II describes the problem including the necessary 
assumptions, network structure, and dynamic programming 
formulation. Chapter III details the solution procedure with 
reference to data structures, traversal techniques and memory 
requirements. In Chapter IV, the procedure is used to solve 
a set of test problems. A description of these problems and 
the performance of the algorithm are reported. Chapter V 
concludes with a summary of results and topics for future 


research. 


». 


II. PROBLEM FORMULATION 


ASSUMPTIONS 


With the following assumptions the underway replenishment 


scheduling problem can be modelled as a Generalized 


Orienteering Problem. 


The time needed to replenish each ship is constant 
regardless of the order the ships are serviced. While 
this is not strictly true, the differential time spent 
replenishing is insignificant compared to the transit 
times. 


Ships move directly from their location to the rendezvous 
point. In practice there is the need to avoid the other 
ships in formation. 


The logistics ship starts and ends its path at a point 
near the center of the formation. 


The battle group maintains its base course and speed 
throughout the time allotted for replenishment. 


Ships engaged in replenishment maintain their relative 
position with the battle group by matching its course and 
speed. 


The optimal path is that path which adds the most combat 
value to the battle group while allowing time for the 
logistic ship to return to its position near the center of 
the formation. 

NETWORK STRUCTURE 


The dynamic programming formulation of the Orienteering 


Problem as applied to scheduling underway replenishment can be 


viewed as a directed graph D= {N, A}. Figure 2 illustrates 


this graph for a scheduling problem where n, the number of 


STAGE 





Figure 2 Graph depicting network structure of three node 
underway replenishment scheduling problem. 


ships, is three and each ship is assigned a single rendezvous 
location. The set of nodes N, contains the possible states of 
the system. A state is a pair of elements where the first 
element contains the set of locations that were previously 
visited, and the second element contains the current location 
of the logistics ship. For example, the node (1,2-3) in 
Figure 2 indicates a state that has visited locations 1 and 2 
and is currently at location 3. 

The set of arcs, A, represent permissible moves from one 
State to the next, and have associated with them the 
applicable time of the move. Note that the graph contains no 
cycles, that is to say there are some locations that cannot be 
visited from certain states. By excluding arcs that would 
create cycles, it 1S guaranteed that a traversal of this graph 
from b to any leaf state yields a feasible solution to the 
Orienteering Problem, and that the shortest path traversal 
from b to any state produces the optimal path to that state. 
Also note that any state at stage k>2 can be arrived at 
through several states in stage k-1. It is this feature of 
the graph that eliminates the need to explicitly enumerate 
every permutation of locations to arrive at an optimal 
solution. At any state it is only necessary to further 
develop the best path that ended in that state since all 
others are inferior and cannot be part of an optimal solution. 

The graph represented in Figure 2 has one stage for each 


ship in the formation, plus one additional stage. Stage one 


represents the possible partial paths with one ship, stage two 
the partial paths with two ships and the complete one ship 
paths, and so on. For example, stage 3 contains the partial 
3 ship paths (1,2-3), (1,3-2), (2,3-1), and the completed 2 
ship paths (1,2-e), (1,3-e), (2,3-e). 

It is possible to group several paths into a single state, 
note that paths 1-2-3 and2-i1- 3 are grouped together 
in the state labeled 1,2 - 3. This is done because for these 
two paths only the shorter of the two can be part of an 
optimal tour. 

The number of states, N,, in any stage two through n can 


be calculated using equation 1. 


N,= (je04) (n-k+2) (1) 


The total number of states in the graph can be calculated 


using equation 2. 


: (2) 
Neotal= (n+2) 2" : 


The total number of states represents the minimum amount of 
information needed to guarantee an optimal solution to the 


problem. 
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C. DISCUSSION OF THE GENERALIZED ORIENTEERING PROBLEM 

The network structure presented above can be easily 
modified for the Generalized Orienteering Problem where each 
ship is allowed to have several potential rendezvous 
locations. In the case of the single location Orienteering 
Problem, a cycle was defined to be a path that returned to the 
same location a second time. In the generalized case that 
principle is extended to prevent the path from returning to 
the same ship a second time, regardless of the ship’s 
location. The number of states in stages, two through n of 


the generalized problem is calculated with equation 3. 


Newey) (Remy Ce) (3) 


Where L represents the number of locations for each ship. 
This equation is further generalized for the case where not 
all ships have the same number of multiple locations in 
Chapter IV. Finally, the total number of states in the graph 


for the generalized problem is computed with equation 4. 
Ne, ., = \(aike+ 2)} 22-4 (4) 
D. RECURSIVE RELATIONSHIP BETWEEN STAGES 
The fact that solutions to problems formulated as dynamic 


programs are made in stages leads to a search for a 


relationship between stages. These relationships are referred 


al 


to as recursive equations. The existence of a recursive 
equation highlights the fact that decisions are made in stages 
and the direction to move in getting from one stage to the 
next has nothing to do with how the system arrived at the 
initial stage. This fact can be stated succinctly as 
Bellman’s Principle of Optimality: 

An optimal policy has the property that whatever the 

initial state and initial decisions are, the remaining 

decisions must constitute an optimal policy with regard to 

the state resulting from the first decision. (Bellman 

157) 
E. DYNAMIC PROGRAMMING FORMULATION 

A mixed integer programming formulation for. the 

Generalized Orienteering Problem is presented in Wu (1992). 
In what follows, the Orienteering Problem is formulated as a 
dynamic program. The problem is viewed as having two 
objective functions. The primary objective ensures that the 
optimal tour collects the largest score possible. The second 
objective breaks ties based on the amount of time required to 
collect the score. For example, given sufficient time 
numerous tours will collect the total score available; the 
optimal tour will be that tour which collects the score in the 


minimum amount of time. This ensures that the solution will 


reduce to the TSP solution when a large amount of time is 


provided. 
b, e Beginning and ending nodes. 
Peco Node indices. 
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f* 


S| 


Stage index. 


An unordered set of nodes, that always contains 
the beginning node b. 


A set of nodes that corresponds to a path ending 
in node i, where i ¢ SS. 

Total time allowed to complete a tour. 

Score received for visiting node i, R(b)=R(e)=0. 


Time required after leaving node 1 to complete 
replenishment a node j. 


Primary objective function; score received by 


visiting the nodes in set S, where, |S|=k. 


Secondary objective function value; minimum time 
required to travel from b to i via all nodes in 
set S. Note that g,(S,i) is only defined when 


|}S|=k and i¢€s 
The maximum score obtainable. 


The minimum time to obtain the maximum possible 
score. 


The recursive equations that link stages together are: 


Cp line, 


ea me De ECye aes |s|- kK-t, 7¢s 
> R(i) Pegs 3) Ss IMAX 
: i1eSK;} 
eye ) 
0 otherwise 


This is clearly the case since the score received for visiting 
the same nodes (set S), regardless of order, is constant. 


The initial conditions for the system are: 


a3 


£, (@,b) 


ll 
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J, (a,b) 


The optimal primary objective function value would be 
f* = max ({ f, (S,e)}, over all possible k and S where | S|=k 
The optimal secondary objective function value would be 
= min {g,(S,e)} over all possible k and S where |S | =k and 


fevo, e) =f* 


This formulation is easily modified for the Generalized 
Orienteering Problem. The only conceptual difference is that 
each node now represents both a ship and location of 
replenishing the ship. This dynamic programming formulation 
serves as the basis for the solution procedure contained in 


Chapter III. 
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IIIT. SOLUTION PROCEDURE 

An optimal solution to the Generalized Orienteering 
Problem provides the minimum time, within an established 
limit, to collect the maximum possible score. This is 
equivalent to finding the shortest path to all states in the 
decision tree shown in Figure 2, and selecting the best path 
based on the total score collected within the established 
time. Evaluating the states in the tree requires a systematic 
search which can be accomplished either in a breadth-first or 


depth-first manner. 


A. BREADTH-FIRST SEARCH 

Figure 3 illustrates the breadth-first search of the 
decision tree. State numbering indicates the order the states 
are evaluated. This is the technique typically used for 
solving dynamic programming problems. The procedure evaluates 
all the states in stage one before moving on to stage two and 
so on. The primary advantage of this procedure is that at 
every stage the state values are known to be optimal, and sub- 
optimal paths are never investigated beyond the point where 
their inferiority is discovered. 

The price paid for this advantage is the space and work 
required to keep track of all the state information at stage 


k-1 while computing the node values in stage k. For instance, 


Lips: 


STAGE 





Figure 3 Breadth-first traversal of Dynamic Programming 
decision tree. 


dS 


a problem with ten ships that each have 16 possible rendezvous 
locations has over 24,000 states in stage 5 that must be 
stored and managed in order to produce the over 17,000 states 
in stage 6. This served as the motivation for conducting a 


depth-first search. 


B. DEPTH-FIRST SEARCH 

The depth-first search moves quickly down the graph, 
always looking for the shortest feasible arc. If the system 
moves into a feasible state, a label is created recording the 
required time and collected score. When the system reaches 
the end node or an infeasible state, the system moves back up 
the decision tree and investigates the arcs from the labeled 
state furthest down the tree, breaking ties based on time 
required. As the procedure continues, the system can arrive 
back at a state that has already been labeled. If the new 
value at that state is inferior to the old value, the path can 
be discarded since the completion of that path cannot be 
optimal. This procedure continues until every feasible state 
in the decision tree is labeled. 

Figure 4 is included to help demonstrate this process 
where the state numbers indicate the order of labelling. The 
algorithm determines order based on required time, but in this 
example the state to the left is arbitrarily chosen when a 
choice is made. Note that states in stages 3 and 4 are 


labelled more than once. While this can happen, it should not 


IO 


STAGE 


cosy 


4,8, 14, 18, 24, 28 


Figure 4 Depth-first Traversal of Dynamic Programming 
Decision Tree. 


Ets: 


happen often since the first label created at any state will 
have the same time label as the nearest-neighbor heuristic 
solution to that state. To the extent that the nearest- 
neighbor heuristic provides ‘good’ paths, the first label 
created at any state can be expected to be superior to future 
labels that are generated. 

Dynamic programming procedures are often ineffective for 
solving large combinatorial problems, like the Generalized 
Orienteering Problem, because of the enormous amount of state 
information that must be simultaneously managed to derive a 
Souution. The depth-first traversal of the decision tree 
greatly reduces the amount of information being managed. For 
the ten ship example described above, no more than 880 labels 
must be managed at any time. The minimum time required to 
each state must still be stored, but this storage requires no 
processing. Again there is a trade-off, the convenience of 
reducing the amount of information being processed is paid for 
by some amount of repeated work. The fact that the nearest 
neighbor heuristic provides ‘good’ solutions to shortest path 
problems indicates that the additional work may justify the 
trade-off. The depth-first search also has the advantage that 
premature termination of the procedure will produce feasible 


solutions. 
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C. ALGORITHM DESCRIPTION 

The following Depth-First Algorithm is described for 
solving problems where each ship is allowed a_e single 
rendezvous location, though it can be easily generalized to 


the multiple location scenario. Let t(i,j) be the time needed 


to transit from node i to node j for i = 1,52, ..., 2 andi 
1, 2, ..., n where n is the total number of nodes. Let t(i,1i1) 
=o for ie= 1, 2, ..., nm. A tour is a route that begqinstean 


the start node and arrives at the end node through some or all 
of the possible nodes. A path is a route that begins at the 
start node and ends at a node other than the designated end 
node. Let R(i) be the value added when the system arrives at 
state 1. The Depth-First Algorithm requires the use of a list 
of temporary labels to store path data that have not yet been 
investigated, and a BEST matrix that records the required time 
of the best known path to each state. 

The algorithm can be summarized with the following 7 
steps. 


Step 1 


Set the incumbent objective function value to a large time 


and low combat value: OFV = (10?°,0) 


Step 2 


Select the beginning node as the current node. 


Z0 


step 3 


Investigate all nodes adjacent to the current node and 
record on the temporary list: the total time to complete the 
path, total combat value, all nodes in the path where the last 
node and its predecessor are distinguished. This information 


is only recorded if extending the partial path does not: 


® create a cycle. 
® exceed the maximum time allowed for the path. 


® create a path known to be inferior. 


Step 4 


For the states where a temporary label was made, update 
the BEST matrix with the time required to reach each state. 
It 18 unnecessary to record the value at the state since every 
path to a state contains the same nodes and must have the same 
value. 
eieep 5 

Search the temporary list for the label that is 


lexicographically smallest on the list based on: 


® Most arcs in the partial path. 
® Least time to reach the current state. 


@® Highest combat value added. 


Record the information from that label and remove the label 
from the temporary list. If the list is empty the problem is 


solved. 


PXAN 


step 6 


If the label removed from the temporary list is froma 
state that represents a completed path, compare that tour to 
the incumbent solution. The new tour is superior if its value 
is more than the value of the incumbent. If the value is the 
same, the tour that requires less time to complete is 
superior. 

Step 7 
Make the node selected in Step 5 the current node and 


return to Step 3. 


D. COLLECTING THE OPTIMAL PATH 

The depth-first traversal of the decision tree greatly 
Simplifies the collection of the optimal path. By using an 
assignment array, every time a new label is selected the 
assignment array is cleared from the new nodes predecessor 
forward to the end of the last path. When a tour is found 
that is superior to the best tour so far, the assignment array 
is stored as the best tour and the process continues. This 
can be done using the depth-first traversal since the path 
prior to the stage where the decision is made remains 
constant. This characteristic is generally not true with the 
breadth-first traversal, and would necessitate the use of a 


more elaborate path collection scheme. 
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E. STATE REPRESENTATION AND BEST MATRIX 

The BEST matrix keeps track of the shortest path to all 
possible states. Because the state is defined as a pair, the 
first element being the set of all nodes included in the path 
so far, and the second element being the last node visited, 
the BEST matrix is two dimensional. The set of all nodes in 
the path can be represented by a binomial expansion of the set 
into a single integer. That is, the set {1, 2, 5, 9} can be 
represented as 2°+21+24+28 or 275. The time required to 
complete the best path that visits nodes 1, 2, 5, and, 9 and 
ends at node 2 can be found as element BEST (275,2). 

This representation of the state has the additional 
advantage of providing a useful method of checking whether 
visiting a node will introduce a cycle in the path. A 
‘hashing function’ ® is defined to compare the set of nodes 
with the candidate node, and determine if the addition of the 
candidate node will introduce a cycle. For example, consider 
the addition of a node corresponding to ship 5 to a path 


containing ships {1, 2, 5, 9}, 


{1, 2, 5, 9} => 275 100010011 
{5} => 16 and 000010000 
275 © 5 000010000 => 16 


A one contained in both number 5 locations indicate a common 
element. This function can be accomplished with a single 


operation in many computer languages. In FORTRAN 77 the 


Zs 


‘integer and’ operation will compare bit strings of two 


integers and return an integer indicating which bits match. 


F. THE GENERALIZED PROBLEM 

The previously described algorithm only needs to be 
Slightly altered to solve the Generalized Orienteering 
Problem. Because many nodes can represent a single ship 
additional information is required to determine which ships 
have been visited. A state changes to a pair where the first 
element is the set of ships that are part of the current path, 


and the second element is the node (ship and location) where 





the path ends. Figure 5 illustrates two paths, A and B, that 
visit the same ships. Ship numbers are represented by Roman 
numerals. Node numbers are represented with Arabic numerals. 

Note that the paths have only their beginning and@ema 
nodes in common, but the longer path would be dominated by the 
shorter path because they visit the same ships, end at the 
same node and thus obtain the same score. 

Note that the total number of states grows exponentially 
by number of sectors and geometrically by number of nodes. 
This characteristic 1S very attractive for Underway 
Replenishment Scheduling problems where the number of ships is 
likely to be small but better answers are achieved by allowing 


numerous locations in each sector. 
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Figure 5 Illustration of dominated path in Generalized 
Orienteering Problem. 
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IV. TEST AND EVALUATION 

The Depth-First Algorithm was programmed in FORTRAN 77 
with computational results collected from a SUN/SPARC2 
workstation. To evaluate the algorithm performance in 
scenarios likely to be faced by decision makers scheduling 
underway replenishment, it was tested against a set of 630 
randomly generated formations. The 630 formations were 
grouped into 21 classes of 30 formations each. The classes 
were divided according to the number of ships, the number of 
ships with multiple locations, and the number of locations 
those ships had. 

The cost matrix (time to transit between nodes) is 
asymmetric due to the movement of the formation. The data for 
each problem was processed to eliminate locations that could 
not be part of the solution. Table I shows in more detail 
the characteristics of the problem classes. For a complete 
description of how the test problems were generated see 
Zabarouskas (1992). 

These test problems were derived for both the Delivery Boy 
and Circuit Rider tactics. For the Delivery Boy tactic, 
multiple ship locations represent an on-station sector, and 
the time between locations represents the replenishment ships 
transit time and the subsequent replenishment of the combatant 


ship. To be valid problems for the Circuit Rider tactic, the 
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time between locations represents the time needed for both 
combatant and replenishment ships to rendezvous and complete 


replenishment. 


A. THE EFFECT OF NODE SCORES 

It was decided to test the algorithm’s performance for 
varied node scores. Each of the 630 problems were run with 
scores randomly generated between one and 100, and then again 
with scores randomly generated between 100 and 200. It was 
hypothesized that there might be some advantage to solving the 
problems with the one to 100 range because of the possibility 
of a node being worth many times that of another node (for 
instance, a node with a score of 80 would be twenty times more 
valuable than a node with a score of four.) This relationship 
between node scores would not be possible in the 100 - 200 
range since, at best, one node could be worth only twice the 
score of another. 

The results of the two test sets showed that the same 
number of labels were created and thus the same amount of CPU 
time was used to solve each set of problems. The paths were 
different when the maximum time allowed was insufficient to 
visit every node, but the same amount of work was required to 
generate the optimal solution. The reason for this is that 
the algorithm seeks the best path to each state based only on 
the time required to reach that state. This is true because 


every path to a state has exactly the same value. 


Ze 


Table I CHARACTERISTICS OF TEST PROBLEM SETS. 


# of ships # of EOtal af avg. fuer 
# of with multiple multiple OL locations 
class ships locations locations locations removed 
BE 6 0 na 6 na 
2 6 3 ile 2.07 
3 6 5 21. e) (Se: 
4 6 2 30 Saa7 
5 6 3. 9 46 8.40 
6 6 é 16 Sil sie [0) 7/ 
7 6 5 16 81 TO 74 
8 8 0 na 8 na 
S 8 4 20 1S, 
10 8 7 29 2.47 
Bags 8 a 40 ZOS 
a2 8 a 9 64 ee 
13 8 “ 16 68 2.310 
14 8 a) dG | Sa 
slo. 10 0 na 10 na 
16 1 5 4 ZS S00 
sa 10 g 2) 7) Liga 
18 10 5 40 Lae 
19 10 9 9 82 130 
20 10 5 16 35 2.0m 
Zi 10 9 16 145 Zeer] 


28 


This result waS considered positive in that’ the 
performance of the algorithm would not vary depending on the 
scoring scale used by the decision maker. On the other hand, 
it highlighted the fact that the algorithm does not take 
advantage of the information contained in the score when 


solving the problem. 


B. MEASURING THE EFFECTIVENESS OF THE ALGORITHM 

To thoroughly determine the effectiveness of the 
algorithm, it was necessary to test it not just with a random 
set of formations but also with a variety of time constraints. 
With this in mind each of the 630 problems were solved four 
times. The first time, the maximum time allowed for a tour 
was large enough to ensure that a tour was found that visited 
all the ships. This tour is equivalent to the solution of the 
TSP problem for that formation. The TSP time served as a 
baseline used to generate the other three problems for the 
formation. The maximum time for these problems were set at 
100%, 75%, and 50% of the time required to complete the TSP 
tour. Problems with a maximum time greater than 100 % of the 
TSP time were not considered since the algorithm reduces TMAX 
whenever a solution that visits every sector is found. This 
adjustment results in times that vary insignificantly from the 


results reported for 100% of the TSP time. 
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Table II ALGORITHM RUN-TIME IN CPU SECONDS. 


100% of TSP 75% of TSP 50% of TSP 


time time time 
class mean st dev mean Sst dev mean st dev 
1 0.029 0.005 0.018 0.005 OP oll 0.004 
2 OS07> Ore Or a 0.045 OF; Os 0.022 0.008 
3 0.143 0.042 Oo, Uae 02026 0.034 Or,015 
4 OF DLS 0.085 0.132 0.054 0.046 0.020 
5 O5on 0.224 0.286 Oe iZo 0.094 0.038 
6 0.463 011902). (0m272 0.129 0.106 0.058 
7 1.673 0.632 0.964 0.394 0.306 Get 30 
8 0.230 0.024 0.130 0.018 0.041 Oho 
9 0.993 0.247 0.540 0.164 0.169 0.084 
10 2099 OF 502 i Oye "al 0.299 O2zZ73 O2127 
ital 2 3m 0.810 1.520 0.483 0.478 0.198 
2 8.884 2.851 4.755 i Wpasye 1.203 0.492 
ie 7 <lOa! 1.875 3.714 1.152 1.105 0.533 
14 26.262 6.915 13.422 3.847 3.354 1.394 
15 1.907 Oo2 14a 0.243 0.283 Gsikoa 
16 8.900 1.594 4.555 1.126 1.084 0.472 
7 21.983 4.599 11 o 7 Aen 8 eee Di Age 1.186 
18 306032 ~<GQDA00eNA4!. 203me 2Mchc 3.142 1.394 
19 101.975 21.103 49.882. 11.180 410.247; 
20 75.013 18.324 36.885 10.060 #7.215 3.139 
Da 280.663 66.892 133.087 35.941 24.935 12.352 
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1. CPU Requirements 

The CPU time required to solve each of these problems 
was recorded. Table II contains the means and standard 
deviations computed for each class. Note that in each class 
the reduction in CPU time is greater than the percentage 
change in TMAX that caused it. The reduction increases as the 
number of ships and the number of locations increase. 
Additionally, the table shows a relatively stable record of 
performance across all classes of problems, as evidenced by 
Standard deviations that are generally 20 to 30 percent of the 
mean CPU time. 

2. Performance Based On Labels Generated 

The second measure of the algorithm’s effectiveness 
relates to the amount of work that is actually being done in 
order to solve the problem. The algorithm creates a label 
every time a path to a state is found for the first time, and 
any time a path is found that is superior to the label already 
created for that state. By comparing the number of times a 
label is made and the number of states in the system, the 
performance when the maximum amount of time to complete a tour 
varies can be made. 

The removal of dominated locations, and the fact that 
not all ships have multiple locations, changes the way the 
total number of states in the system is calculated. Equation 


4 from Chapter II becomes equation 5. 


at 


COC #2 yori la) ) tame ae Coy) 


total — 


Where m is the number of ships with multiple locations and r 
is the number of locations removed by dominance. This formula 
is for problems where there is sufficient time to visit every 
ship. In scenarios where the maximum time allowed constrains 
the solution, the number of feasible states is a function of 
the formation and cannot be computed deterministicly. In 
these cases the number of feasible states is kept track of in 
the program. 

Table III shows the average number of states for each 
class and the average number of labels that were created 
before the shortest path to each state was found. The table 
shows a reduction in the number of states similar to the 
reduction that occurred in CPU requirements. The table also 
shows that an increase in the number of locations increases 
the number of times labels are written over. The overwrites 
are an estimate of the extra work needed to use the depth- 
first traversal of the decision tree. This extra work grows 
at a distressing pace for the larger problems, but it is these 
problems that would require the breadth-first traversal to 
manage a large number of states. Additionally, the CPU 
requirements for these problems remains modest despite the 


additional work being done. 
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Table III WORK REQUIRED TO OBTAIN SOLUTIONS 
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V. CONCLUSIONS 


A. ALGORITHM PERFORMANCE 

The computational results of the algorithm showed 
excellent performance over a wide range of scenarios. These 
results, however, are probably more a function of the improved 
network structure than the traversal technique chosen. The 
depth-first traversal created many more labels in the multiple 
location scenarios than would have been created by a breadth- 
first traversal. But despite this, the algorithm provided in 
this thesis is an excellent solver for the Delivery Boy, and 


Circuit Rider tactics, with or without a time constraint. 


B. RECOMMENDATION FOR FUTURE RESEARCH 

Clearly, there is reason to think an algorithm that 
conducts a breadth-first search of the described decision tree 
may improve on these results. Furthermore, any additional 
means of removing nodes or arcs prior to the implementation of 
the algorithm is likely to improve performance. 

Future research should also include the relaxation of the 
assumption requiring the formation to maintain course and 
speed during the entire replenishment cycle, and the 
assumption that the replenishing ships will be able to match 


the course and speed of the battle group. 
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C. SUMMARY 

This thesis was undertaken to develop a specialized 
solution procedure to schedule underway replenishment without 
the use of commercial integer programming software. The 
reported algorithm selects the optimal set of ships to 
replenish, establishes the correct order, and finds the 
optimal rendezvous locations to conduct the replenishment. 
The algorithm was tested against a set of randomly generated 
formations, and was determined to provide optimal solutions 
quickly enough to meet the needs of underway replenishment 


schedulers. 
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